TRECVID 2005 Experiments at Johns Hopkins University: Using Hidden Markov Models for Video Retrieval
نویسندگان
چکیده
The use of hidden Markov models (HMMs) for the high-level feature detection task of TRECVID 2005 is described. Highlevel features present in a keyframe are assumed to constitute the state-space of a Markov chain. The observed visual features of the keyframe, as well as the text accompanying the keyframe, are modeled as stochastic emissions from (unobserved) states of this Markov chain. Manual annotations of shots for the presence of these high-level features, as provided by NIST, constitute the data from which parameters of the HMM are estimated. The estimated HMM enables computation of the a posteriori probability that a particular high-level feature is present in a keyframe in the test collection, given the visual features of the keyframe and its accompanying closed-caption or transcribed text. It is demonstrated that different subsets of the set of visual and text features, and slightly different HMM-settings, are optimal for detecting different high-level features. Finally, it is demonstrated that the posterior probabilities of highlevel features in a test keyframe, computed by the HMMs, may themselves be used as inputs to a support vector machine to further improve detection performance. Results for high-level feature detection are presented on a held-out portion of the manually annotated NIST corpus, as well as the TRECVID 2005 video search collection.
منابع مشابه
Imperial College and Johns Hopkins University at TRECVID
We describe our experiments for the high-level feature extraction and search tasks. For the search task, we tested the system we have used in previous years, which encapsulates content based image search, image browsing, automated image annotation and named entity extraction. For the feature task we apply the nonparametric density estimation model and the HMM-based concept specific image model.
متن کاملThe JHU/APL HAIRCUT System at TREC-8
The Johns Hopkins University Applied Physics Laboratory (JHU/APL) is a second-time entrant in the TREC Category A evaluation. The focus of our information retrieval research this year has been on the relative value of and interaction among multiple term types and multiple similarity metrics. In particular, we are interested in examining words and n-grams as indexing terms, and vector models and...
متن کاملNotebook Paper Brno University of Technology at TRECVid 2013 Interactive Surveillance Event Detection
In the paper, we describe our experiments in the interactive surveillance event detection pilot (SED) of the 2013 TRECVid evaluation [13]. Our approach inherits functionality of the Surveillance Network Augmented by Retrieval Event Detection (SUNAR-ED) system, which is an information retrieval based wide area (video) surveillance system being developed at Faculty of Information Technology, Brno...
متن کاملIRIT @ TRECVid 2010 : Hidden Markov Models for Context-aware Late Fusion of Multiple Audio Classifiers
This notebook paper describes the four runs submitted by IRIT at TRECVid 2010 Semantic Indexing task. The four submitted runs can be described and compared as follows: • Run 4 – late fusion (weighted sum) of multiple audio-only classifiers output • Run 3 – context-aware re-rank of run 4 using hidden Markov model • Run 2 – context-aware late fusion of multiple audio classifiers output with hidde...
متن کاملTRECVID 2004: Video Search Experiments at IUB
The experiments presented in this paper explore topics surrounding video information retrieval (IR). This paper will discuss in detail our participation at TRECVID 2004. A video retrieval system named ViewFinder was developed to search and browse the TRECVID 2004 test data, and both manual and interactive search experiments were carried out. Each of the performed search experiments were in agre...
متن کامل